Skip to content

Conversation

@jcrist
Copy link
Contributor

@jcrist jcrist commented Jan 9, 2018

One race condition was already fixed, but another one exists when
writing by partition.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It might be worth adding exist_ok to the filesystem interface. Python has this kwarg for os.makedirs (but not os.mkdir) to say not to error if the directory already exists (defaults to False).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Before adding more things to the interface, I would prefer that we continue to settle the discussion on a uniform file interface between dask, Arrow and others. Then we actually know to what we all want to converge to.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe worth opening a JIRA about this in either case

One race condition was already fixed, but another one exists when
writing by partition.
@jcrist jcrist force-pushed the fix-race-condition-parquet-partition branch from 64d9bff to 180c4a2 Compare January 9, 2018 23:39
@xhochy xhochy changed the title Arrow-1980: [Python] Fix race condition in write_to_dataset ARROW-1980: [Python] Fix race condition in write_to_dataset Jan 10, 2018
@wesm wesm closed this in d0a93ba Jan 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants